Experiences with Co-array Fortran on Hardware Shared Memory Platforms
نویسندگان
چکیده
When performing source-to-source compilation of Co-array Fortran (CAF) programs into SPMD Fortran 90 codes for shared-memory multiprocessors, there are several ways of representing and manipulating data at the Fortran 90 language level. We describe a set of implementation alternatives and evaluate their performance implications for CAF variants of the STREAM, Random Access, Spark98 and NAS MG & SP benchmarks. We compare the performance of library-based implementations of one-sided communication with finegrain communication that accesses remote data using load and store operations. Our experiments show that using application-level loads and stores for fine-grain communication can improve performance by as much as a factor of 24; however, codes requiring only coarse-grain communication can achieve better performance by using an architecture’s tuned memcpy for bulk data movement.
منابع مشابه
A Data Structure Oriented Monitoring Environment for Fortran OpenMP Programs
This paper describes a monitoring environment that enables the analysis of memory access behavior of applications in a selective way with a potentially very high degree of detail. It is based on a novel hardware monitor design that employs an associative counter array to measure data structure related information at runtime. A simulator for this hardware monitor is implemented, providing the ca...
متن کاملDiagnosing Network Bottlenecks: One-sided Message Contention
Two trends suggest that one-sided message network contention is poised to become a cause of concern for scientific application developers. First, there is an increased interest in one-sided messages motivated by Global Address Space (GAS) programming models such as Unified Parallel C (UPC) [1], Co-Array Fortran (CAF) [2], [3], Global Arrays [4], and Chapel [5]. The GAS programming model provide...
متن کاملInterface Synthesis using Memory Mapping for an FPGA Platform
Several system-on-chip (SoC) platforms have recently emerged that use reconfigurable logic (FPGAs) as a programmable co-processor to reduce the computational load on the main processor core. We present an interface synthesis approach that enables us to do hardware-software codesign for such FPGA-based platforms. The approach is based on a novel memory mapping algorithm that maps data used by bo...
متن کاملCo-Array Fortran Experiences with Finite Differencing Methods∗
Partial differential equations are used to describe physical phenomena in many science and engineering fields. Finite differencing methods map these continuous equations into discrete space so that they may be solved on computers. Co-Array Fortran provides the means for implementing such solution methods in parallel processing computing environments. In this report we describe our experiences w...
متن کاملcient Parallelization of Unstructured Reductions on Shared Memory Parallel Architectures ?
This paper presents a new parallelization method for an efcient implementation of unstructured array reductions on shared memory parallel machines with OpenMP. This method is strongly related to parallelization techniques for irregular reductions on distributed memory machines as employed in the context of High Performance Fortran. By exploiting data locality, synchronization is minimized witho...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004